Search CORE

19 research outputs found

Move Forward and Tell: A Progressive Generator of Video Descriptions

Author: A Farhadi
A Geiger
A Rohrbach
A Rohrbach
G Kulkarni
J Steinberger
L Wang
M Regneri
Publication venue
Publication date: 26/07/2018
Field of study

We present an efficient framework that can generate a coherent paragraph to describe a given video. Previous works on video captioning usually focus on video clips. They typically treat an entire video as a whole and generate the caption conditioned on a single embedding. On the contrary, we consider videos with rich temporal structures and aim to generate paragraph descriptions that can preserve the story flow while being coherent and concise. Towards this goal, we propose a new approach, which produces a descriptive paragraph by assembling temporally localized descriptions. Given a video, it selects a sequence of distinctive clips and generates sentences thereon in a coherent manner. Particularly, the selection of clips and the production of sentences are done jointly and progressively driven by a recurrent network -- what to describe next depends on what have been said before. Here, the recurrent network is learned via self-critical sequence training with both sentence-level and paragraph-level rewards. On the ActivityNet Captions dataset, our method demonstrated the capability of generating high-quality paragraph descriptions for videos. Compared to those by other methods, the descriptions produced by our method are often more relevant, more coherent, and more concise.Comment: Accepted by ECCV 201

arXiv.org e-Print Archive

Crossref

Évaluation des caractéristiques d'un réseau d'assainissement unitaire par la génération de rejets discontinus de déversoirs d'orage

Author: Klepiszewski K.
Regneri M.
Schutz G.
Publication venue: GRAIE, Lyon, France (FRA)
Publication date: 01/01/2016
Field of study

I-Revues

Movie Description

Author: A Kojima
Aaron Courville
Anna Rohrbach
Atousa Torabi
Bernt Schiele
C Fellbaum
Christopher Pal
H Wang
Hugo Larochelle
M Regneri
Marcus Rohrbach
Niket Tandon
O Russakovsky
P Young
R Kiros
R Socher
S Hochreiter
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Underspecified modelling of complex discourse constraints

Author: Egg M.
Regneri M.
Publication venue
Publication date: 01/01/2008
Field of study

Efficient processing of underspecified discourse representations

Author: Egg M.
Koller A.
Regneri M.
Publication venue
Publication date: 01/01/2008
Field of study

Dissertations of the University of Groningen

Efficient processing of underspecified discourse representations

Author: Egg M.
Koller A.
Regneri M.
Publication venue
Publication date: 01/01/2008
Field of study

Underspecification-based algorithms for pro-cessing partially disambiguated discourse structure must cope with extremely high num-bers of readings. Based on previous work on dominance graphs and weighted tree gram-mars, we provide the first possibility for com-puting an underspecified discourse description and a best discourse representation efficiently enough to process even the longest discourses in the RST Discourse Treebank.

CiteSeerX

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Script Data for Attribute-based Recognition of Composite Activities

Author: Amin S.
Andriluka M.
Pinkal M.
Regneri M.
Rohrbach M.
Schiele B.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

MPG.PuRe

Coherent Multi-sentence Video Description with Variable Level of Detail

Author: A Farhadi
A Kojima
H Wang
I Zukerman
M Regneri
M Rohrbach
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Humans can easily describe what they see in a coherent way and at varying level of detail. However, existing approaches for automatic video description focus on generating only single sentences and are not able to vary the descriptions’ level of detail. In this paper, we address both of these limitations: for a variable level of detail we produce coherent multi-sentence descriptions of complex videos. To understand the difference between detailed and short descriptions, we collect and analyze a video description corpus of three levels of detail. We follow a two-step approach where we first learn to predict a semantic representation (SR) from video and then generate natural language descriptions from it. For our multi-sentence descriptions we model across-sentence consistency at the level of the SR by enforcing a consistent topic. Human judges rate our descriptions as more readable, correct, and relevant than related work

OPUS Augsburg

Crossref

CISPA – Helmholtz-Zentrum für Informationssicherheit

MPG.PuRe

Optimization methods applied to stormwater management problems: a review

Author: Dajani J. S.
Fiorelli D.
Froise S.
Fuchs L.
Geneviève Pelletier
Huff F. A.
Marinaki M.
Ministere de Developpement durable de l’Environnement et des Parcs (MDDEP)
Minnesota Stormwater Steering Committee
Pleau M.
Pleau M.
Pleau M.
Regneri M.
Regneri M.
Saber-Freedman N.
Shadab Shishegar
Shrestha D. L.
Solvers F.
Sophie Duchesne
Tobergte D. R.
Watkins D.W.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering

Author: CD Manning
J Cohen
J Krishnamurthy
JL Fleiss
M Regneri
M Ren
Marcus Rohrbach
Mario Fritz
Mateusz Malinowski
P Liang
S Hochreiter
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref